Dataset statistics
| Number of variables | 24 |
|---|---|
| Number of observations | 41221 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 7.5 MiB |
| Average record size in memory | 192.0 B |
Variable types
| Numeric | 12 |
|---|---|
| Categorical | 12 |
int_rate has a high cardinality: 394 distinct values | High cardinality |
earliest_cr_line has a high cardinality: 519 distinct values | High cardinality |
revol_util has a high cardinality: 1116 distinct values | High cardinality |
last_credit_pull_d has a high cardinality: 108 distinct values | High cardinality |
loan_amnt is highly correlated with installment | High correlation |
installment is highly correlated with loan_amnt | High correlation |
open_acc is highly correlated with total_acc | High correlation |
total_acc is highly correlated with open_acc | High correlation |
loan_amnt is highly correlated with installment | High correlation |
installment is highly correlated with loan_amnt | High correlation |
open_acc is highly correlated with total_acc | High correlation |
total_acc is highly correlated with open_acc | High correlation |
loan_amnt is highly correlated with installment | High correlation |
installment is highly correlated with loan_amnt | High correlation |
open_acc is highly correlated with total_acc | High correlation |
total_acc is highly correlated with open_acc | High correlation |
installment is highly correlated with loan_amnt | High correlation |
loan_amnt is highly correlated with installment | High correlation |
total_acc is highly correlated with open_acc | High correlation |
fico_average is highly correlated with grade | High correlation |
open_acc is highly correlated with total_acc | High correlation |
inq_last_6mths is highly correlated with df_index | High correlation |
grade is highly correlated with fico_average | High correlation |
loan_status is highly correlated with df_index | High correlation |
df_index is highly correlated with inq_last_6mths and 1 other fields | High correlation |
annual_inc is highly skewed (γ1 = 29.30357761) | Skewed |
df_index is uniformly distributed | Uniform |
df_index has unique values | Unique |
delinq_2yrs has 36619 (88.8%) zeros | Zeros |
inq_last_6mths has 19037 (46.2%) zeros | Zeros |
pub_rec has 38988 (94.6%) zeros | Zeros |
revol_bal has 997 (2.4%) zeros | Zeros |
Reproduction
| Analysis started | 2021-08-10 18:29:52.627874 |
|---|---|
| Analysis finished | 2021-08-10 18:31:05.658475 |
| Duration | 1 minute and 13.03 seconds |
| Software version | pandas-profiling v3.0.0 |
| Download configuration | config.json |
| Distinct | 41221 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 21312.42469 |
| Minimum | 0 |
|---|---|
| Maximum | 42449 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 322.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2116 |
| Q1 | 10682 |
| median | 21366 |
| Q3 | 31990 |
| 95-th percentile | 40368 |
| Maximum | 42449 |
| Range | 42449 |
| Interquartile range (IQR) | 21308 |
Descriptive statistics
| Standard deviation | 12269.89789 |
|---|---|
| Coefficient of variation (CV) | 0.5757157183 |
| Kurtosis | -1.203348579 |
| Mean | 21312.42469 |
| Median Absolute Deviation (MAD) | 10655 |
| Skewness | -0.01091485817 |
| Sum | 878519458 |
| Variance | 150550394.1 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 2047 | 1 | < 0.1% |
| 42398 | 1 | < 0.1% |
| 36219 | 1 | < 0.1% |
| 34170 | 1 | < 0.1% |
| 40313 | 1 | < 0.1% |
| 38264 | 1 | < 0.1% |
| 11631 | 1 | < 0.1% |
| 9582 | 1 | < 0.1% |
| 15725 | 1 | < 0.1% |
| 13676 | 1 | < 0.1% |
| Other values (41211) | 41211 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 |
| Value | Count | Frequency (%) |
| 42449 | 1 | |
| 42448 | 1 | |
| 42446 | 1 | |
| 42445 | 1 | |
| 42444 | 1 | |
| 42443 | 1 | |
| 42442 | 1 | |
| 42441 | 1 | |
| 42440 | 1 | |
| 42439 | 1 |
| Distinct | 893 |
|---|---|
| Distinct (%) | 2.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11184.61888 |
| Minimum | 500 |
|---|---|
| Maximum | 35000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 322.2 KiB |
Quantile statistics
| Minimum | 500 |
|---|---|
| 5-th percentile | 2400 |
| Q1 | 5500 |
| median | 10000 |
| Q3 | 15000 |
| 95-th percentile | 25000 |
| Maximum | 35000 |
| Range | 34500 |
| Interquartile range (IQR) | 9500 |
Descriptive statistics
| Standard deviation | 7417.438565 |
|---|---|
| Coefficient of variation (CV) | 0.6631820576 |
| Kurtosis | 0.7585805103 |
| Mean | 11184.61888 |
| Median Absolute Deviation (MAD) | 5000 |
| Skewness | 1.054145051 |
| Sum | 461041175 |
| Variance | 55018394.86 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10000 | 2940 | 7.1% |
| 12000 | 2393 | 5.8% |
| 5000 | 2150 | 5.2% |
| 6000 | 1977 | 4.8% |
| 15000 | 1977 | 4.8% |
| 20000 | 1696 | 4.1% |
| 8000 | 1646 | 4.0% |
| 25000 | 1475 | 3.6% |
| 4000 | 1185 | 2.9% |
| 3000 | 1074 | 2.6% |
| Other values (883) | 22708 |
| Value | Count | Frequency (%) |
| 500 | 11 | |
| 550 | 1 | < 0.1% |
| 600 | 6 | |
| 700 | 2 | < 0.1% |
| 725 | 1 | < 0.1% |
| 750 | 1 | < 0.1% |
| 800 | 3 | < 0.1% |
| 850 | 1 | < 0.1% |
| 900 | 4 | < 0.1% |
| 925 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 35000 | 675 | |
| 34800 | 2 | < 0.1% |
| 34675 | 1 | < 0.1% |
| 34525 | 1 | < 0.1% |
| 34475 | 5 | < 0.1% |
| 34200 | 1 | < 0.1% |
| 34000 | 15 | < 0.1% |
| 33950 | 7 | < 0.1% |
| 33600 | 6 | < 0.1% |
| 33500 | 2 | < 0.1% |
term
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 322.2 KiB |
| 36 months | |
|---|---|
| 60 months |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 412210 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 36 months |
|---|---|
| 2nd row | 60 months |
| 3rd row | 36 months |
| 4th row | 36 months |
| 5th row | 60 months |
Common Values
| Value | Count | Frequency (%) |
| 36 months | 30501 | |
| 60 months | 10720 | 26.0% |
Length
Pie chart
| Value | Count | Frequency (%) |
| months | 41221 | |
| 36 | 30501 | |
| 60 | 10720 | 13.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 82442 | ||
| 6 | 41221 | |
| m | 41221 | |
| o | 41221 | |
| n | 41221 | |
| t | 41221 | |
| h | 41221 | |
| s | 41221 | |
| 3 | 30501 | 7.4% |
| 0 | 10720 | 2.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 247326 | |
| Space Separator | 82442 | 20.0% |
| Decimal Number | 82442 | 20.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| m | 41221 | |
| o | 41221 | |
| n | 41221 | |
| t | 41221 | |
| h | 41221 | |
| s | 41221 |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 41221 | |
| 3 | 30501 | |
| 0 | 10720 | 13.0% |
Space Separator
| Value | Count | Frequency (%) |
| 82442 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 247326 | |
| Common | 164884 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| m | 41221 | |
| o | 41221 | |
| n | 41221 | |
| t | 41221 | |
| h | 41221 | |
| s | 41221 |
Common
| Value | Count | Frequency (%) |
| 82442 | ||
| 6 | 41221 | |
| 3 | 30501 | 18.5% |
| 0 | 10720 | 6.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 412210 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 82442 | ||
| 6 | 41221 | |
| m | 41221 | |
| o | 41221 | |
| n | 41221 | |
| t | 41221 | |
| h | 41221 | |
| s | 41221 | |
| 3 | 30501 | 7.4% |
| 0 | 10720 | 2.6% |
| Distinct | 394 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 322.2 KiB |
| 10.99% | 946 |
|---|---|
| 13.49% | 818 |
| 11.49% | 812 |
| 7.51% | 756 |
| 7.88% | 715 |
| Other values (389) |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Characters and Unicode
| Total characters | 288547 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 3 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 15 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 10.65% |
|---|---|
| 2nd row | 15.27% |
| 3rd row | 15.96% |
| 4th row | 13.49% |
| 5th row | 12.69% |
Common Values
| Value | Count | Frequency (%) |
| 10.99% | 946 | 2.3% |
| 13.49% | 818 | 2.0% |
| 11.49% | 812 | 2.0% |
| 7.51% | 756 | 1.8% |
| 7.88% | 715 | 1.7% |
| 7.49% | 633 | 1.5% |
| 11.71% | 591 | 1.4% |
| 9.99% | 587 | 1.4% |
| 7.90% | 559 | 1.4% |
| 5.42% | 524 | 1.3% |
| Other values (384) | 34280 |
Length
| Value | Count | Frequency (%) |
| 10.99 | 946 | 2.3% |
| 13.49 | 818 | 2.0% |
| 11.49 | 812 | 2.0% |
| 7.51 | 756 | 1.8% |
| 7.88 | 715 | 1.7% |
| 7.49 | 633 | 1.5% |
| 11.71 | 591 | 1.4% |
| 9.99 | 587 | 1.4% |
| 7.90 | 559 | 1.4% |
| 5.42 | 524 | 1.3% |
| Other values (384) | 34280 |
Most occurring characters
| Value | Count | Frequency (%) |
| 53007 | ||
| . | 41221 | |
| % | 41221 | |
| 1 | 40505 | |
| 9 | 21898 | |
| 2 | 13314 | 4.6% |
| 6 | 12450 | 4.3% |
| 7 | 12400 | 4.3% |
| 4 | 11683 | 4.0% |
| 3 | 10668 | 3.7% |
| Other values (3) | 30180 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 153098 | |
| Other Punctuation | 82442 | |
| Space Separator | 53007 | 18.4% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 40505 | |
| 9 | 21898 | |
| 2 | 13314 | 8.7% |
| 6 | 12450 | 8.1% |
| 7 | 12400 | 8.1% |
| 4 | 11683 | 7.6% |
| 3 | 10668 | 7.0% |
| 5 | 10561 | 6.9% |
| 8 | 10032 | 6.6% |
| 0 | 9587 | 6.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 41221 | |
| % | 41221 |
Space Separator
| Value | Count | Frequency (%) |
| 53007 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 288547 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 53007 | ||
| . | 41221 | |
| % | 41221 | |
| 1 | 40505 | |
| 9 | 21898 | |
| 2 | 13314 | 4.6% |
| 6 | 12450 | 4.3% |
| 7 | 12400 | 4.3% |
| 4 | 11683 | 4.0% |
| 3 | 10668 | 3.7% |
| Other values (3) | 30180 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 288547 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 53007 | ||
| . | 41221 | |
| % | 41221 | |
| 1 | 40505 | |
| 9 | 21898 | |
| 2 | 13314 | 4.6% |
| 6 | 12450 | 4.3% |
| 7 | 12400 | 4.3% |
| 4 | 11683 | 4.0% |
| 3 | 10668 | 3.7% |
| Other values (3) | 30180 |
| Distinct | 16128 |
|---|---|
| Distinct (%) | 39.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 325.4442859 |
| Minimum | 15.67 |
|---|---|
| Maximum | 1305.19 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 322.2 KiB |
Quantile statistics
| Minimum | 15.67 |
|---|---|
| 5-th percentile | 71.76 |
| Q1 | 167.57 |
| median | 280.82 |
| Q3 | 432.26 |
| 95-th percentile | 768.19 |
| Maximum | 1305.19 |
| Range | 1289.52 |
| Interquartile range (IQR) | 264.69 |
Descriptive statistics
| Standard deviation | 209.2316031 |
|---|---|
| Coefficient of variation (CV) | 0.6429106675 |
| Kurtosis | 1.171865356 |
| Mean | 325.4442859 |
| Median Absolute Deviation (MAD) | 123.45 |
| Skewness | 1.114593785 |
| Sum | 13415138.91 |
| Variance | 43777.86373 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 311.11 | 68 | 0.2% |
| 311.02 | 54 | 0.1% |
| 180.96 | 53 | 0.1% |
| 150.8 | 46 | 0.1% |
| 368.45 | 45 | 0.1% |
| 372.12 | 44 | 0.1% |
| 317.72 | 42 | 0.1% |
| 339.31 | 42 | 0.1% |
| 330.76 | 42 | 0.1% |
| 186.61 | 41 | 0.1% |
| Other values (16118) | 40744 |
| Value | Count | Frequency (%) |
| 15.67 | 1 | |
| 15.69 | 1 | |
| 15.75 | 1 | |
| 15.76 | 1 | |
| 15.91 | 1 | |
| 16.08 | 1 | |
| 16.25 | 1 | |
| 16.31 | 1 | |
| 16.47 | 1 | |
| 16.73 | 1 |
| Value | Count | Frequency (%) |
| 1305.19 | 1 | < 0.1% |
| 1302.69 | 1 | < 0.1% |
| 1295.21 | 1 | < 0.1% |
| 1288.1 | 2 | |
| 1283.5 | 1 | < 0.1% |
| 1276.6 | 3 | |
| 1272.2 | 1 | < 0.1% |
| 1269.73 | 4 | |
| 1265.16 | 1 | < 0.1% |
| 1263.23 | 1 | < 0.1% |
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 322.2 KiB |
| B | |
|---|---|
| A | |
| C | |
| D | |
| E | |
| Other values (2) |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 41221 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | B |
|---|---|
| 2nd row | C |
| 3rd row | C |
| 4th row | C |
| 5th row | B |
Common Values
| Value | Count | Frequency (%) |
| B | 12012 | |
| A | 9753 | |
| C | 8519 | |
| D | 5861 | |
| E | 3314 | 8.0% |
| F | 1262 | 3.1% |
| G | 500 | 1.2% |
Length
Pie chart
| Value | Count | Frequency (%) |
| b | 12012 | |
| a | 9753 | |
| c | 8519 | |
| d | 5861 | |
| e | 3314 | 8.0% |
| f | 1262 | 3.1% |
| g | 500 | 1.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| B | 12012 | |
| A | 9753 | |
| C | 8519 | |
| D | 5861 | |
| E | 3314 | 8.0% |
| F | 1262 | 3.1% |
| G | 500 | 1.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 41221 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 12012 | |
| A | 9753 | |
| C | 8519 | |
| D | 5861 | |
| E | 3314 | 8.0% |
| F | 1262 | 3.1% |
| G | 500 | 1.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 41221 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| B | 12012 | |
| A | 9753 | |
| C | 8519 | |
| D | 5861 | |
| E | 3314 | 8.0% |
| F | 1262 | 3.1% |
| G | 500 | 1.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 41221 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| B | 12012 | |
| A | 9753 | |
| C | 8519 | |
| D | 5861 | |
| E | 3314 | 8.0% |
| F | 1262 | 3.1% |
| G | 500 | 1.2% |
emp_length
Categorical
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 322.2 KiB |
| 10+ years | |
|---|---|
| < 1 year | |
| 2 years | |
| 3 years | |
| 4 years | |
| Other values (6) |
Length
| Max length | 9 |
|---|---|
| Median length | 7 |
| Mean length | 7.488658693 |
| Min length | 6 |
Characters and Unicode
| Total characters | 308690 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 10+ years |
|---|---|
| 2nd row | < 1 year |
| 3rd row | 10+ years |
| 4th row | 10+ years |
| 5th row | 1 year |
Common Values
| Value | Count | Frequency (%) |
| 10+ years | 9356 | |
| < 1 year | 4993 | |
| 2 years | 4719 | |
| 3 years | 4349 | |
| 4 years | 3635 | 8.8% |
| 1 year | 3562 | 8.6% |
| 5 years | 3449 | 8.4% |
| 6 years | 2368 | 5.7% |
| 7 years | 1868 | 4.5% |
| 8 years | 1586 | 3.8% |
Length
| Value | Count | Frequency (%) |
| years | 32666 | |
| 10 | 9356 | 10.7% |
| year | 8555 | 9.8% |
| 1 | 8555 | 9.8% |
| 4993 | 5.7% | |
| 2 | 4719 | 5.4% |
| 3 | 4349 | 5.0% |
| 4 | 3635 | 4.2% |
| 5 | 3449 | 3.9% |
| 6 | 2368 | 2.7% |
| Other values (3) | 4790 | 5.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 46214 | ||
| y | 41221 | |
| e | 41221 | |
| a | 41221 | |
| r | 41221 | |
| s | 32666 | |
| 1 | 17911 | 5.8% |
| 0 | 9356 | 3.0% |
| + | 9356 | 3.0% |
| < | 4993 | 1.6% |
| Other values (8) | 23310 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 197550 | |
| Decimal Number | 50577 | 16.4% |
| Space Separator | 46214 | 15.0% |
| Math Symbol | 14349 | 4.6% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 17911 | |
| 0 | 9356 | |
| 2 | 4719 | 9.3% |
| 3 | 4349 | 8.6% |
| 4 | 3635 | 7.2% |
| 5 | 3449 | 6.8% |
| 6 | 2368 | 4.7% |
| 7 | 1868 | 3.7% |
| 8 | 1586 | 3.1% |
| 9 | 1336 | 2.6% |
Lowercase Letter
| Value | Count | Frequency (%) |
| y | 41221 | |
| e | 41221 | |
| a | 41221 | |
| r | 41221 | |
| s | 32666 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 9356 | |
| < | 4993 |
Space Separator
| Value | Count | Frequency (%) |
| 46214 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 197550 | |
| Common | 111140 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 46214 | ||
| 1 | 17911 | 16.1% |
| 0 | 9356 | 8.4% |
| + | 9356 | 8.4% |
| < | 4993 | 4.5% |
| 2 | 4719 | 4.2% |
| 3 | 4349 | 3.9% |
| 4 | 3635 | 3.3% |
| 5 | 3449 | 3.1% |
| 6 | 2368 | 2.1% |
| Other values (3) | 4790 | 4.3% |
Latin
| Value | Count | Frequency (%) |
| y | 41221 | |
| e | 41221 | |
| a | 41221 | |
| r | 41221 | |
| s | 32666 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 308690 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 46214 | ||
| y | 41221 | |
| e | 41221 | |
| a | 41221 | |
| r | 41221 | |
| s | 32666 | |
| 1 | 17911 | 5.8% |
| 0 | 9356 | 3.0% |
| + | 9356 | 3.0% |
| < | 4993 | 1.6% |
| Other values (8) | 23310 |
home_ownership
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 322.2 KiB |
| RENT | |
|---|---|
| MORTGAGE | |
| OWN | |
| OTHER | 134 |
| NONE | 2 |
Length
| Max length | 8 |
|---|---|
| Median length | 4 |
| Mean length | 5.718129109 |
| Min length | 3 |
Characters and Unicode
| Total characters | 235707 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | RENT |
|---|---|
| 2nd row | RENT |
| 3rd row | RENT |
| 4th row | RENT |
| 5th row | RENT |
Common Values
| Value | Count | Frequency (%) |
| RENT | 19649 | |
| MORTGAGE | 18425 | |
| OWN | 3011 | 7.3% |
| OTHER | 134 | 0.3% |
| NONE | 2 | < 0.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| rent | 19649 | |
| mortgage | 18425 | |
| own | 3011 | 7.3% |
| other | 134 | 0.3% |
| none | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 38210 | |
| R | 38208 | |
| T | 38208 | |
| G | 36850 | |
| N | 22664 | |
| O | 21572 | |
| M | 18425 | |
| A | 18425 | |
| W | 3011 | 1.3% |
| H | 134 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 235707 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 38210 | |
| R | 38208 | |
| T | 38208 | |
| G | 36850 | |
| N | 22664 | |
| O | 21572 | |
| M | 18425 | |
| A | 18425 | |
| W | 3011 | 1.3% |
| H | 134 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 235707 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 38210 | |
| R | 38208 | |
| T | 38208 | |
| G | 36850 | |
| N | 22664 | |
| O | 21572 | |
| M | 18425 | |
| A | 18425 | |
| W | 3011 | 1.3% |
| H | 134 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 235707 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 38210 | |
| R | 38208 | |
| T | 38208 | |
| G | 36850 | |
| N | 22664 | |
| O | 21572 | |
| M | 18425 | |
| A | 18425 | |
| W | 3011 | 1.3% |
| H | 134 | 0.1% |
| Distinct | 5365 |
|---|---|
| Distinct (%) | 13.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 69771.55031 |
| Minimum | 1896 |
|---|---|
| Maximum | 6000000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 322.2 KiB |
Quantile statistics
| Minimum | 1896 |
|---|---|
| 5-th percentile | 24000 |
| Q1 | 41000 |
| median | 60000 |
| Q3 | 83400 |
| 95-th percentile | 145000 |
| Maximum | 6000000 |
| Range | 5998104 |
| Interquartile range (IQR) | 42400 |
Descriptive statistics
| Standard deviation | 64520.72619 |
|---|---|
| Coefficient of variation (CV) | 0.9247426194 |
| Kurtosis | 2126.31231 |
| Mean | 69771.55031 |
| Median Absolute Deviation (MAD) | 20000 |
| Skewness | 29.30357761 |
| Sum | 2876053075 |
| Variance | 4162924108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 60000 | 1552 | 3.8% |
| 50000 | 1090 | 2.6% |
| 40000 | 911 | 2.2% |
| 45000 | 874 | 2.1% |
| 75000 | 854 | 2.1% |
| 30000 | 846 | 2.1% |
| 65000 | 826 | 2.0% |
| 70000 | 775 | 1.9% |
| 48000 | 738 | 1.8% |
| 80000 | 706 | 1.7% |
| Other values (5355) | 32049 |
| Value | Count | Frequency (%) |
| 1896 | 1 | < 0.1% |
| 2000 | 1 | < 0.1% |
| 3300 | 1 | < 0.1% |
| 3500 | 1 | < 0.1% |
| 3600 | 1 | < 0.1% |
| 4000 | 1 | < 0.1% |
| 4080 | 1 | < 0.1% |
| 4500 | 1 | < 0.1% |
| 4800 | 2 | |
| 5000 | 3 |
| Value | Count | Frequency (%) |
| 6000000 | 1 | < 0.1% |
| 3900000 | 1 | < 0.1% |
| 2039784 | 1 | < 0.1% |
| 1900000 | 1 | < 0.1% |
| 1782000 | 1 | < 0.1% |
| 1440000 | 2 | |
| 1362000 | 1 | < 0.1% |
| 1250000 | 1 | < 0.1% |
| 1200000 | 4 | |
| 1176000 | 1 | < 0.1% |
verification_status
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 322.2 KiB |
| Not Verified | |
|---|---|
| Verified | |
| Source Verified |
Length
| Max length | 15 |
|---|---|
| Median length | 12 |
| Mean length | 11.47361782 |
| Min length | 8 |
Characters and Unicode
| Total characters | 472954 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Verified |
|---|---|
| 2nd row | Source Verified |
| 3rd row | Not Verified |
| 4th row | Source Verified |
| 5th row | Source Verified |
Common Values
| Value | Count | Frequency (%) |
| Not Verified | 18132 | |
| Verified | 12995 | |
| Source Verified | 10094 |
Length
Pie chart
| Value | Count | Frequency (%) |
| verified | 41221 | |
| not | 18132 | |
| source | 10094 | 14.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 92536 | |
| i | 82442 | |
| r | 51315 | |
| V | 41221 | |
| f | 41221 | |
| d | 41221 | |
| o | 28226 | 6.0% |
| 28226 | 6.0% | |
| N | 18132 | 3.8% |
| t | 18132 | 3.8% |
| Other values (3) | 30282 | 6.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 375281 | |
| Uppercase Letter | 69447 | 14.7% |
| Space Separator | 28226 | 6.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 92536 | |
| i | 82442 | |
| r | 51315 | |
| f | 41221 | |
| d | 41221 | |
| o | 28226 | 7.5% |
| t | 18132 | 4.8% |
| u | 10094 | 2.7% |
| c | 10094 | 2.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| V | 41221 | |
| N | 18132 | |
| S | 10094 | 14.5% |
Space Separator
| Value | Count | Frequency (%) |
| 28226 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 444728 | |
| Common | 28226 | 6.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 92536 | |
| i | 82442 | |
| r | 51315 | |
| V | 41221 | |
| f | 41221 | |
| d | 41221 | |
| o | 28226 | 6.3% |
| N | 18132 | 4.1% |
| t | 18132 | 4.1% |
| S | 10094 | 2.3% |
| Other values (2) | 20188 | 4.5% |
Common
| Value | Count | Frequency (%) |
| 28226 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 472954 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 92536 | |
| i | 82442 | |
| r | 51315 | |
| V | 41221 | |
| f | 41221 | |
| d | 41221 | |
| o | 28226 | 6.0% |
| 28226 | 6.0% | |
| N | 18132 | 3.8% |
| t | 18132 | 3.8% |
| Other values (3) | 30282 | 6.4% |
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 322.2 KiB |
| Fully Paid | |
|---|---|
| Charged Off | |
| Does not meet the credit policy. Status:Fully Paid | 1895 |
| Does not meet the credit policy. Status:Charged Off | 723 |
| Current | 494 |
| Other values (4) | 32 |
Length
| Max length | 51 |
|---|---|
| Median length | 10 |
| Mean length | 12.65782004 |
| Min length | 7 |
Characters and Unicode
| Total characters | 521768 |
|---|---|
| Distinct characters | 38 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Fully Paid |
|---|---|
| 2nd row | Charged Off |
| 3rd row | Fully Paid |
| 4th row | Fully Paid |
| 5th row | Current |
Common Values
| Value | Count | Frequency (%) |
| Fully Paid | 32676 | |
| Charged Off | 5401 | 13.1% |
| Does not meet the credit policy. Status:Fully Paid | 1895 | 4.6% |
| Does not meet the credit policy. Status:Charged Off | 723 | 1.8% |
| Current | 494 | 1.2% |
| In Grace Period | 15 | < 0.1% |
| Late (31-120 days) | 12 | < 0.1% |
| Late (16-30 days) | 4 | < 0.1% |
| Default | 1 | < 0.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| paid | 34571 | |
| fully | 32676 | |
| off | 6124 | 6.3% |
| charged | 5401 | 5.5% |
| the | 2618 | 2.7% |
| credit | 2618 | 2.7% |
| policy | 2618 | 2.7% |
| does | 2618 | 2.7% |
| not | 2618 | 2.7% |
| meet | 2618 | 2.7% |
| Other values (11) | 3206 | 3.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 71761 | |
| 56465 | ||
| a | 43361 | |
| d | 43344 | |
| i | 39822 | 7.6% |
| u | 37684 | 7.2% |
| y | 37205 | 7.1% |
| P | 34586 | 6.6% |
| F | 34571 | 6.6% |
| e | 19755 | 3.8% |
| Other values (28) | 103214 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 372761 | |
| Uppercase Letter | 87182 | 16.7% |
| Space Separator | 56465 | 10.8% |
| Other Punctuation | 5236 | 1.0% |
| Decimal Number | 76 | < 0.1% |
| Open Punctuation | 16 | < 0.1% |
| Dash Punctuation | 16 | < 0.1% |
| Close Punctuation | 16 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 71761 | |
| a | 43361 | |
| d | 43344 | |
| i | 39822 | |
| u | 37684 | |
| y | 37205 | |
| e | 19755 | 5.3% |
| t | 16219 | 4.4% |
| f | 12249 | 3.3% |
| r | 9760 | 2.6% |
| Other values (8) | 41601 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 34586 | |
| F | 34571 | |
| C | 6618 | 7.6% |
| O | 6124 | 7.0% |
| D | 2619 | 3.0% |
| S | 2618 | 3.0% |
| L | 16 | < 0.1% |
| I | 15 | < 0.1% |
| G | 15 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 28 | |
| 3 | 16 | |
| 0 | 16 | |
| 2 | 12 | |
| 6 | 4 | 5.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2618 | |
| : | 2618 |
Space Separator
| Value | Count | Frequency (%) |
| 56465 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 16 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 16 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 16 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 459943 | |
| Common | 61825 | 11.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 71761 | |
| a | 43361 | |
| d | 43344 | |
| i | 39822 | |
| u | 37684 | |
| y | 37205 | |
| P | 34586 | |
| F | 34571 | |
| e | 19755 | 4.3% |
| t | 16219 | 3.5% |
| Other values (17) | 81635 |
Common
| Value | Count | Frequency (%) |
| 56465 | ||
| . | 2618 | 4.2% |
| : | 2618 | 4.2% |
| 1 | 28 | < 0.1% |
| ( | 16 | < 0.1% |
| 3 | 16 | < 0.1% |
| - | 16 | < 0.1% |
| 0 | 16 | < 0.1% |
| ) | 16 | < 0.1% |
| 2 | 12 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 521768 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| l | 71761 | |
| 56465 | ||
| a | 43361 | |
| d | 43344 | |
| i | 39822 | 7.6% |
| u | 37684 | 7.2% |
| y | 37205 | 7.1% |
| P | 34586 | 6.6% |
| F | 34571 | 6.6% |
| e | 19755 | 3.8% |
| Other values (28) | 103214 |
purpose
Categorical
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 322.2 KiB |
| debt_consolidation | |
|---|---|
| credit_card | |
| other | |
| home_improvement | |
| major_purchase | |
| Other values (9) |
Length
| Max length | 18 |
|---|---|
| Median length | 16 |
| Mean length | 13.73071978 |
| Min length | 3 |
Characters and Unicode
| Total characters | 565994 |
|---|---|
| Distinct characters | 22 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | credit_card |
|---|---|
| 2nd row | car |
| 3rd row | small_business |
| 4th row | other |
| 5th row | other |
Common Values
| Value | Count | Frequency (%) |
| debt_consolidation | 19314 | |
| credit_card | 5309 | 12.9% |
| other | 4205 | 10.2% |
| home_improvement | 3086 | 7.5% |
| major_purchase | 2231 | 5.4% |
| small_business | 1938 | 4.7% |
| car | 1555 | 3.8% |
| wedding | 989 | 2.4% |
| medical | 724 | 1.8% |
| moving | 597 | 1.4% |
| Other values (4) | 1273 | 3.1% |
Length
| Value | Count | Frequency (%) |
| debt_consolidation | 19314 | |
| credit_card | 5309 | 12.9% |
| other | 4205 | 10.2% |
| home_improvement | 3086 | 7.5% |
| major_purchase | 2231 | 5.4% |
| small_business | 1938 | 4.7% |
| car | 1555 | 3.8% |
| wedding | 989 | 2.4% |
| medical | 724 | 1.8% |
| moving | 597 | 1.4% |
| Other values (4) | 1273 | 3.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 72322 | |
| d | 52348 | |
| i | 52036 | |
| t | 51993 | |
| n | 46199 | |
| e | 45268 | 8.0% |
| c | 35207 | 6.2% |
| a | 34930 | 6.2% |
| _ | 31976 | 5.6% |
| s | 29707 | 5.2% |
| Other values (12) | 114008 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 534018 | |
| Connector Punctuation | 31976 | 5.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 72322 | |
| d | 52348 | |
| i | 52036 | |
| t | 51993 | |
| n | 46199 | |
| e | 45268 | |
| c | 35207 | 6.6% |
| a | 34930 | 6.5% |
| s | 29707 | 5.6% |
| l | 24412 | 4.6% |
| Other values (11) | 89596 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 31976 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 534018 | |
| Common | 31976 | 5.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 72322 | |
| d | 52348 | |
| i | 52036 | |
| t | 51993 | |
| n | 46199 | |
| e | 45268 | |
| c | 35207 | 6.6% |
| a | 34930 | 6.5% |
| s | 29707 | 5.6% |
| l | 24412 | 4.6% |
| Other values (11) | 89596 |
Common
| Value | Count | Frequency (%) |
| _ | 31976 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 565994 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 72322 | |
| d | 52348 | |
| i | 52036 | |
| t | 51993 | |
| n | 46199 | |
| e | 45268 | 8.0% |
| c | 35207 | 6.2% |
| a | 34930 | 6.2% |
| _ | 31976 | 5.6% |
| s | 29707 | 5.2% |
| Other values (12) | 114008 |
addr_state
Categorical
| Distinct | 50 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 322.2 KiB |
| CA | |
|---|---|
| NY | |
| FL | |
| TX | |
| NJ | 1949 |
| Other values (45) |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 82442 |
|---|---|
| Distinct characters | 24 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | AZ |
|---|---|
| 2nd row | GA |
| 3rd row | IL |
| 4th row | CA |
| 5th row | OR |
Common Values
| Value | Count | Frequency (%) |
| CA | 7228 | |
| NY | 3933 | 9.5% |
| FL | 2985 | 7.2% |
| TX | 2850 | 6.9% |
| NJ | 1949 | 4.7% |
| IL | 1631 | 4.0% |
| PA | 1610 | 3.9% |
| VA | 1452 | 3.5% |
| GA | 1451 | 3.5% |
| MA | 1385 | 3.4% |
| Other values (40) | 14747 |
Length
| Value | Count | Frequency (%) |
| ca | 7228 | |
| ny | 3933 | 9.5% |
| fl | 2985 | 7.2% |
| tx | 2850 | 6.9% |
| nj | 1949 | 4.7% |
| il | 1631 | 4.0% |
| pa | 1610 | 3.9% |
| va | 1452 | 3.5% |
| ga | 1451 | 3.5% |
| ma | 1385 | 3.4% |
| Other values (40) | 14747 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 16118 | |
| C | 10344 | |
| N | 8245 | |
| L | 5530 | 6.7% |
| M | 4925 | 6.0% |
| Y | 4368 | 5.3% |
| T | 4087 | 5.0% |
| O | 3620 | 4.4% |
| I | 3299 | 4.0% |
| F | 2985 | 3.6% |
| Other values (14) | 18921 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 82442 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 16118 | |
| C | 10344 | |
| N | 8245 | |
| L | 5530 | 6.7% |
| M | 4925 | 6.0% |
| Y | 4368 | 5.3% |
| T | 4087 | 5.0% |
| O | 3620 | 4.4% |
| I | 3299 | 4.0% |
| F | 2985 | 3.6% |
| Other values (14) | 18921 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 82442 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 16118 | |
| C | 10344 | |
| N | 8245 | |
| L | 5530 | 6.7% |
| M | 4925 | 6.0% |
| Y | 4368 | 5.3% |
| T | 4087 | 5.0% |
| O | 3620 | 4.4% |
| I | 3299 | 4.0% |
| F | 2985 | 3.6% |
| Other values (14) | 18921 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 82442 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 16118 | |
| C | 10344 | |
| N | 8245 | |
| L | 5530 | 6.7% |
| M | 4925 | 6.0% |
| Y | 4368 | 5.3% |
| T | 4087 | 5.0% |
| O | 3620 | 4.4% |
| I | 3299 | 4.0% |
| F | 2985 | 3.6% |
| Other values (14) | 18921 |
dti
Real number (ℝ≥0)
| Distinct | 2891 |
|---|---|
| Distinct (%) | 7.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13.40131972 |
| Minimum | 0 |
|---|---|
| Maximum | 29.99 |
| Zeros | 187 |
| Zeros (%) | 0.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 322.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2.14 |
| Q1 | 8.24 |
| median | 13.5 |
| Q3 | 18.69 |
| 95-th percentile | 23.92 |
| Maximum | 29.99 |
| Range | 29.99 |
| Interquartile range (IQR) | 10.45 |
Descriptive statistics
| Standard deviation | 6.713904199 |
|---|---|
| Coefficient of variation (CV) | 0.5009882863 |
| Kurtosis | -0.8488100863 |
| Mean | 13.40131972 |
| Median Absolute Deviation (MAD) | 5.22 |
| Skewness | -0.03250169309 |
| Sum | 552415.8 |
| Variance | 45.07650959 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 187 | 0.5% |
| 12 | 51 | 0.1% |
| 18 | 45 | 0.1% |
| 19.2 | 45 | 0.1% |
| 13.2 | 42 | 0.1% |
| 16.8 | 41 | 0.1% |
| 12.48 | 40 | 0.1% |
| 14.29 | 37 | 0.1% |
| 13.5 | 37 | 0.1% |
| 4.8 | 36 | 0.1% |
| Other values (2881) | 40660 |
| Value | Count | Frequency (%) |
| 0 | 187 | |
| 0.01 | 3 | < 0.1% |
| 0.02 | 5 | < 0.1% |
| 0.03 | 2 | < 0.1% |
| 0.04 | 3 | < 0.1% |
| 0.05 | 1 | < 0.1% |
| 0.06 | 1 | < 0.1% |
| 0.07 | 5 | < 0.1% |
| 0.08 | 5 | < 0.1% |
| 0.09 | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 29.99 | 1 | < 0.1% |
| 29.96 | 1 | < 0.1% |
| 29.95 | 2 | |
| 29.93 | 3 | |
| 29.92 | 2 | |
| 29.9 | 1 | < 0.1% |
| 29.89 | 1 | < 0.1% |
| 29.88 | 1 | < 0.1% |
| 29.86 | 1 | < 0.1% |
| 29.85 | 1 | < 0.1% |
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1523980495 |
| Minimum | 0 |
|---|---|
| Maximum | 11 |
| Zeros | 36619 |
| Zeros (%) | 88.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 322.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 11 |
| Range | 11 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.5090790329 |
|---|---|
| Coefficient of variation (CV) | 3.340456354 |
| Kurtosis | 43.60868145 |
| Mean | 0.1523980495 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 5.196366964 |
| Sum | 6282 |
| Variance | 0.2591614617 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 36619 | |
| 1 | 3499 | 8.5% |
| 2 | 746 | 1.8% |
| 3 | 238 | 0.6% |
| 4 | 68 | 0.2% |
| 5 | 26 | 0.1% |
| 6 | 13 | < 0.1% |
| 7 | 6 | < 0.1% |
| 8 | 3 | < 0.1% |
| 11 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 36619 | |
| 1 | 3499 | 8.5% |
| 2 | 746 | 1.8% |
| 3 | 238 | 0.6% |
| 4 | 68 | 0.2% |
| 5 | 26 | 0.1% |
| 6 | 13 | < 0.1% |
| 7 | 6 | < 0.1% |
| 8 | 3 | < 0.1% |
| 9 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 11 | 2 | < 0.1% |
| 9 | 1 | < 0.1% |
| 8 | 3 | < 0.1% |
| 7 | 6 | < 0.1% |
| 6 | 13 | < 0.1% |
| 5 | 26 | 0.1% |
| 4 | 68 | 0.2% |
| 3 | 238 | 0.6% |
| 2 | 746 | 1.8% |
| 1 | 3499 |
| Distinct | 519 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 322.2 KiB |
| Oct-1999 | 387 |
|---|---|
| Nov-1998 | 385 |
| Oct-2000 | 359 |
| Dec-1998 | 358 |
| Nov-2000 | 337 |
| Other values (514) |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Characters and Unicode
| Total characters | 329768 |
|---|---|
| Distinct characters | 33 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 34 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Jan-1985 |
|---|---|
| 2nd row | Apr-1999 |
| 3rd row | Nov-2001 |
| 4th row | Feb-1996 |
| 5th row | Jan-1996 |
Common Values
| Value | Count | Frequency (%) |
| Oct-1999 | 387 | 0.9% |
| Nov-1998 | 385 | 0.9% |
| Oct-2000 | 359 | 0.9% |
| Dec-1998 | 358 | 0.9% |
| Nov-2000 | 337 | 0.8% |
| Dec-1997 | 337 | 0.8% |
| Nov-1999 | 329 | 0.8% |
| Oct-1998 | 323 | 0.8% |
| Sep-2000 | 317 | 0.8% |
| Nov-1997 | 316 | 0.8% |
| Other values (509) | 37773 |
Length
| Value | Count | Frequency (%) |
| oct-1999 | 387 | 0.9% |
| nov-1998 | 385 | 0.9% |
| oct-2000 | 359 | 0.9% |
| dec-1998 | 358 | 0.9% |
| nov-2000 | 337 | 0.8% |
| dec-1997 | 337 | 0.8% |
| nov-1999 | 329 | 0.8% |
| oct-1998 | 323 | 0.8% |
| sep-2000 | 317 | 0.8% |
| nov-1997 | 316 | 0.8% |
| Other values (509) | 37773 |
Most occurring characters
| Value | Count | Frequency (%) |
| 9 | 49971 | |
| - | 41221 | 12.5% |
| 0 | 35789 | 10.9% |
| 1 | 29519 | 9.0% |
| 2 | 18976 | 5.8% |
| e | 10881 | 3.3% |
| J | 9805 | 3.0% |
| u | 9667 | 2.9% |
| a | 9451 | 2.9% |
| 8 | 8664 | 2.6% |
| Other values (23) | 105824 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 164884 | |
| Lowercase Letter | 82442 | |
| Uppercase Letter | 41221 | 12.5% |
| Dash Punctuation | 41221 | 12.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 10881 | |
| u | 9667 | |
| a | 9451 | |
| c | 8448 | |
| n | 6627 | |
| p | 6584 | |
| r | 5730 | |
| t | 4268 | 5.2% |
| o | 4090 | 5.0% |
| v | 4090 | 5.0% |
| Other values (4) | 12606 |
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 49971 | |
| 0 | 35789 | |
| 1 | 29519 | |
| 2 | 18976 | 11.5% |
| 8 | 8664 | 5.3% |
| 7 | 4892 | 3.0% |
| 4 | 4432 | 2.7% |
| 5 | 4376 | 2.7% |
| 6 | 4371 | 2.7% |
| 3 | 3894 | 2.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| J | 9805 | |
| A | 6301 | |
| M | 5876 | |
| O | 4268 | |
| D | 4180 | |
| N | 4090 | |
| S | 3720 | 9.0% |
| F | 2981 | 7.2% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 41221 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 206105 | |
| Latin | 123663 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 10881 | 8.8% |
| J | 9805 | 7.9% |
| u | 9667 | 7.8% |
| a | 9451 | 7.6% |
| c | 8448 | 6.8% |
| n | 6627 | 5.4% |
| p | 6584 | 5.3% |
| A | 6301 | 5.1% |
| M | 5876 | 4.8% |
| r | 5730 | 4.6% |
| Other values (12) | 44293 |
Common
| Value | Count | Frequency (%) |
| 9 | 49971 | |
| - | 41221 | |
| 0 | 35789 | |
| 1 | 29519 | |
| 2 | 18976 | 9.2% |
| 8 | 8664 | 4.2% |
| 7 | 4892 | 2.4% |
| 4 | 4432 | 2.2% |
| 5 | 4376 | 2.1% |
| 6 | 4371 | 2.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 329768 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 9 | 49971 | |
| - | 41221 | 12.5% |
| 0 | 35789 | 10.9% |
| 1 | 29519 | 9.0% |
| 2 | 18976 | 5.8% |
| e | 10881 | 3.3% |
| J | 9805 | 3.0% |
| u | 9667 | 2.9% |
| a | 9451 | 2.9% |
| 8 | 8664 | 2.6% |
| Other values (23) | 105824 |
| Distinct | 28 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.081317775 |
| Minimum | 0 |
|---|---|
| Maximum | 33 |
| Zeros | 19037 |
| Zeros (%) | 46.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 322.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 4 |
| Maximum | 33 |
| Range | 33 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.521441632 |
|---|---|
| Coefficient of variation (CV) | 1.407025453 |
| Kurtosis | 31.41822687 |
| Mean | 1.081317775 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 3.446584413 |
| Sum | 44573 |
| Variance | 2.314784639 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 19037 | |
| 1 | 10909 | |
| 2 | 5824 | 14.1% |
| 3 | 3094 | 7.5% |
| 4 | 1025 | 2.5% |
| 5 | 588 | 1.4% |
| 6 | 325 | 0.8% |
| 7 | 173 | 0.4% |
| 8 | 112 | 0.3% |
| 9 | 46 | 0.1% |
| Other values (18) | 88 | 0.2% |
| Value | Count | Frequency (%) |
| 0 | 19037 | |
| 1 | 10909 | |
| 2 | 5824 | 14.1% |
| 3 | 3094 | 7.5% |
| 4 | 1025 | 2.5% |
| 5 | 588 | 1.4% |
| 6 | 325 | 0.8% |
| 7 | 173 | 0.4% |
| 8 | 112 | 0.3% |
| 9 | 46 | 0.1% |
| Value | Count | Frequency (%) |
| 33 | 1 | < 0.1% |
| 32 | 1 | < 0.1% |
| 31 | 1 | < 0.1% |
| 28 | 1 | < 0.1% |
| 27 | 1 | < 0.1% |
| 25 | 1 | < 0.1% |
| 24 | 2 | |
| 20 | 1 | < 0.1% |
| 19 | 2 | |
| 18 | 3 |
| Distinct | 44 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9.37323209 |
| Minimum | 1 |
|---|---|
| Maximum | 47 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 322.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 6 |
| median | 9 |
| Q3 | 12 |
| 95-th percentile | 18 |
| Maximum | 47 |
| Range | 46 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 4.487523337 |
|---|---|
| Coefficient of variation (CV) | 0.478759439 |
| Kurtosis | 1.96594772 |
| Mean | 9.37323209 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 1.045304521 |
| Sum | 386374 |
| Variance | 20.1378657 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 7 | 4146 | |
| 8 | 4063 | |
| 6 | 4035 | |
| 9 | 3823 | |
| 10 | 3301 | 8.0% |
| 5 | 3231 | 7.8% |
| 11 | 2877 | 7.0% |
| 4 | 2405 | 5.8% |
| 12 | 2350 | 5.7% |
| 13 | 2005 | 4.9% |
| Other values (34) | 8985 |
| Value | Count | Frequency (%) |
| 1 | 34 | 0.1% |
| 2 | 629 | 1.5% |
| 3 | 1521 | 3.7% |
| 4 | 2405 | |
| 5 | 3231 | |
| 6 | 4035 | |
| 7 | 4146 | |
| 8 | 4063 | |
| 9 | 3823 | |
| 10 | 3301 |
| Value | Count | Frequency (%) |
| 47 | 1 | < 0.1% |
| 46 | 1 | < 0.1% |
| 44 | 1 | < 0.1% |
| 42 | 1 | < 0.1% |
| 41 | 1 | < 0.1% |
| 39 | 1 | < 0.1% |
| 38 | 2 | |
| 37 | 1 | < 0.1% |
| 36 | 2 | |
| 35 | 4 |
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.05645180854 |
| Minimum | 0 |
|---|---|
| Maximum | 5 |
| Zeros | 38988 |
| Zeros (%) | 94.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 322.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.2427821039 |
|---|---|
| Coefficient of variation (CV) | 4.30069665 |
| Kurtosis | 28.43678714 |
| Mean | 0.05645180854 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.720084777 |
| Sum | 2327 |
| Variance | 0.05894314996 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 38988 | |
| 1 | 2157 | 5.2% |
| 2 | 62 | 0.2% |
| 3 | 11 | < 0.1% |
| 4 | 2 | < 0.1% |
| 5 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 38988 | |
| 1 | 2157 | 5.2% |
| 2 | 62 | 0.2% |
| 3 | 11 | < 0.1% |
| 4 | 2 | < 0.1% |
| 5 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 5 | 1 | < 0.1% |
| 4 | 2 | < 0.1% |
| 3 | 11 | < 0.1% |
| 2 | 62 | 0.2% |
| 1 | 2157 | 5.2% |
| 0 | 38988 |
| Distinct | 22409 |
|---|---|
| Distinct (%) | 54.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 14404.05458 |
| Minimum | 0 |
|---|---|
| Maximum | 1207359 |
| Zeros | 997 |
| Zeros (%) | 2.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 322.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 324 |
| Q1 | 3712 |
| median | 8943 |
| Q3 | 17392 |
| 95-th percentile | 44711 |
| Maximum | 1207359 |
| Range | 1207359 |
| Interquartile range (IQR) | 13680 |
Descriptive statistics
| Standard deviation | 22088.47451 |
|---|---|
| Coefficient of variation (CV) | 1.533490059 |
| Kurtosis | 351.6086808 |
| Mean | 14404.05458 |
| Median Absolute Deviation (MAD) | 6129 |
| Skewness | 11.12198398 |
| Sum | 593749534 |
| Variance | 487900706.1 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 997 | 2.4% |
| 255 | 14 | < 0.1% |
| 298 | 14 | < 0.1% |
| 1 | 12 | < 0.1% |
| 682 | 11 | < 0.1% |
| 400 | 10 | < 0.1% |
| 39 | 10 | < 0.1% |
| 1763 | 9 | < 0.1% |
| 182 | 9 | < 0.1% |
| 1159 | 9 | < 0.1% |
| Other values (22399) | 40126 |
| Value | Count | Frequency (%) |
| 0 | 997 | |
| 1 | 12 | < 0.1% |
| 2 | 6 | < 0.1% |
| 3 | 7 | < 0.1% |
| 4 | 3 | < 0.1% |
| 5 | 7 | < 0.1% |
| 6 | 9 | < 0.1% |
| 7 | 5 | < 0.1% |
| 8 | 5 | < 0.1% |
| 9 | 7 | < 0.1% |
| Value | Count | Frequency (%) |
| 1207359 | 1 | |
| 952013 | 1 | |
| 602519 | 1 | |
| 508961 | 1 | |
| 487589 | 1 | |
| 465731 | 1 | |
| 423189 | 1 | |
| 407794 | 1 | |
| 401941 | 1 | |
| 394107 | 1 |
| Distinct | 1116 |
|---|---|
| Distinct (%) | 2.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 322.2 KiB |
| 0% | 1031 |
|---|---|
| 0.2% | 63 |
| 40.7% | 63 |
| 63% | 62 |
| 66.6% | 61 |
| Other values (1111) |
Length
| Max length | 6 |
|---|---|
| Median length | 5 |
| Mean length | 4.647970695 |
| Min length | 2 |
Characters and Unicode
| Total characters | 191594 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 112 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | 83.7% |
|---|---|
| 2nd row | 9.4% |
| 3rd row | 98.5% |
| 4th row | 21% |
| 5th row | 53.9% |
Common Values
| Value | Count | Frequency (%) |
| 0% | 1031 | 2.5% |
| 0.2% | 63 | 0.2% |
| 40.7% | 63 | 0.2% |
| 63% | 62 | 0.2% |
| 66.6% | 61 | 0.1% |
| 70.4% | 60 | 0.1% |
| 0.1% | 59 | 0.1% |
| 37.6% | 59 | 0.1% |
| 78.7% | 58 | 0.1% |
| 66.7% | 58 | 0.1% |
| Other values (1106) | 39647 |
Length
| Value | Count | Frequency (%) |
| 0 | 1031 | 2.5% |
| 0.2 | 63 | 0.2% |
| 40.7 | 63 | 0.2% |
| 63 | 62 | 0.2% |
| 66.6 | 61 | 0.1% |
| 70.4 | 60 | 0.1% |
| 0.1 | 59 | 0.1% |
| 37.6 | 59 | 0.1% |
| 66.7 | 58 | 0.1% |
| 78.7 | 58 | 0.1% |
| Other values (1106) | 39647 |
Most occurring characters
| Value | Count | Frequency (%) |
| % | 41221 | |
| . | 36179 | |
| 4 | 12533 | 6.5% |
| 5 | 12529 | 6.5% |
| 7 | 12496 | 6.5% |
| 6 | 12484 | 6.5% |
| 3 | 12300 | 6.4% |
| 8 | 11983 | 6.3% |
| 2 | 11955 | 6.2% |
| 1 | 11465 | 6.0% |
| Other values (2) | 16449 | 8.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 114194 | |
| Other Punctuation | 77400 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 12533 | |
| 5 | 12529 | |
| 7 | 12496 | |
| 6 | 12484 | |
| 3 | 12300 | |
| 8 | 11983 | |
| 2 | 11955 | |
| 1 | 11465 | |
| 9 | 11282 | |
| 0 | 5167 |
Other Punctuation
| Value | Count | Frequency (%) |
| % | 41221 | |
| . | 36179 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 191594 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| % | 41221 | |
| . | 36179 | |
| 4 | 12533 | 6.5% |
| 5 | 12529 | 6.5% |
| 7 | 12496 | 6.5% |
| 6 | 12484 | 6.5% |
| 3 | 12300 | 6.4% |
| 8 | 11983 | 6.3% |
| 2 | 11955 | 6.2% |
| 1 | 11465 | 6.0% |
| Other values (2) | 16449 | 8.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 191594 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| % | 41221 | |
| . | 36179 | |
| 4 | 12533 | 6.5% |
| 5 | 12529 | 6.5% |
| 7 | 12496 | 6.5% |
| 6 | 12484 | 6.5% |
| 3 | 12300 | 6.4% |
| 8 | 11983 | 6.3% |
| 2 | 11955 | 6.2% |
| 1 | 11465 | 6.0% |
| Other values (2) | 16449 | 8.6% |
| Distinct | 83 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 22.18177628 |
| Minimum | 1 |
|---|---|
| Maximum | 90 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 322.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 7 |
| Q1 | 13 |
| median | 20 |
| Q3 | 29 |
| 95-th percentile | 44 |
| Maximum | 90 |
| Range | 89 |
| Interquartile range (IQR) | 16 |
Descriptive statistics
| Standard deviation | 11.57319276 |
|---|---|
| Coefficient of variation (CV) | 0.5217432821 |
| Kurtosis | 0.6546840675 |
| Mean | 22.18177628 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 0.8198893335 |
| Sum | 914355 |
| Variance | 133.9387906 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 15 | 1510 | 3.7% |
| 16 | 1505 | 3.7% |
| 17 | 1496 | 3.6% |
| 14 | 1490 | 3.6% |
| 20 | 1468 | 3.6% |
| 18 | 1455 | 3.5% |
| 21 | 1433 | 3.5% |
| 13 | 1431 | 3.5% |
| 19 | 1373 | 3.3% |
| 12 | 1372 | 3.3% |
| Other values (73) | 26688 |
| Value | Count | Frequency (%) |
| 1 | 20 | < 0.1% |
| 2 | 36 | 0.1% |
| 3 | 217 | 0.5% |
| 4 | 457 | 1.1% |
| 5 | 591 | |
| 6 | 719 | |
| 7 | 863 | |
| 8 | 1036 | |
| 9 | 1099 | |
| 10 | 1213 |
| Value | Count | Frequency (%) |
| 90 | 1 | |
| 87 | 1 | |
| 81 | 1 | |
| 80 | 1 | |
| 79 | 2 | |
| 78 | 1 | |
| 77 | 1 | |
| 76 | 1 | |
| 75 | 2 | |
| 74 | 1 |
| Distinct | 108 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 322.2 KiB |
| Sep-2016 | |
|---|---|
| Mar-2016 | 836 |
| Aug-2016 | 748 |
| Feb-2013 | 688 |
| Apr-2016 | 685 |
| Other values (103) |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Characters and Unicode
| Total characters | 329768 |
|---|---|
| Distinct characters | 33 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Sep-2016 |
|---|---|
| 2nd row | Sep-2016 |
| 3rd row | Sep-2016 |
| 4th row | Apr-2016 |
| 5th row | Sep-2016 |
Common Values
| Value | Count | Frequency (%) |
| Sep-2016 | 15784 | |
| Mar-2016 | 836 | 2.0% |
| Aug-2016 | 748 | 1.8% |
| Feb-2013 | 688 | 1.7% |
| Apr-2016 | 685 | 1.7% |
| Jul-2016 | 603 | 1.5% |
| Feb-2016 | 594 | 1.4% |
| Jun-2016 | 521 | 1.3% |
| Jan-2016 | 514 | 1.2% |
| May-2016 | 499 | 1.2% |
| Other values (98) | 19749 |
Length
| Value | Count | Frequency (%) |
| sep-2016 | 15784 | |
| mar-2016 | 836 | 2.0% |
| aug-2016 | 748 | 1.8% |
| feb-2013 | 688 | 1.7% |
| apr-2016 | 685 | 1.7% |
| jul-2016 | 603 | 1.5% |
| feb-2016 | 594 | 1.4% |
| jun-2016 | 521 | 1.3% |
| jan-2016 | 514 | 1.2% |
| may-2016 | 499 | 1.2% |
| Other values (98) | 19749 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 44554 | |
| 1 | 42943 | |
| 0 | 42422 | |
| - | 41221 | |
| e | 22111 | 6.7% |
| 6 | 20784 | 6.3% |
| p | 19803 | 6.0% |
| S | 17574 | 5.3% |
| u | 6780 | 2.1% |
| a | 6558 | 2.0% |
| Other values (23) | 65018 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 164884 | |
| Lowercase Letter | 82442 | |
| Uppercase Letter | 41221 | 12.5% |
| Dash Punctuation | 41221 | 12.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 22111 | |
| p | 19803 | |
| u | 6780 | 8.2% |
| a | 6558 | 8.0% |
| r | 4949 | 6.0% |
| n | 3729 | 4.5% |
| c | 3694 | 4.5% |
| b | 2589 | 3.1% |
| g | 2462 | 3.0% |
| l | 2326 | 2.8% |
| Other values (4) | 7441 | 9.0% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 44554 | |
| 1 | 42943 | |
| 0 | 42422 | |
| 6 | 20784 | |
| 4 | 5144 | 3.1% |
| 5 | 4521 | 2.7% |
| 3 | 4166 | 2.5% |
| 9 | 264 | 0.2% |
| 8 | 59 | < 0.1% |
| 7 | 27 | < 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 17574 | |
| J | 6055 | 14.7% |
| M | 4821 | 11.7% |
| A | 4691 | 11.4% |
| F | 2589 | 6.3% |
| D | 1948 | 4.7% |
| N | 1797 | 4.4% |
| O | 1746 | 4.2% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 41221 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 206105 | |
| Latin | 123663 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 22111 | |
| p | 19803 | |
| S | 17574 | |
| u | 6780 | 5.5% |
| a | 6558 | 5.3% |
| J | 6055 | 4.9% |
| r | 4949 | 4.0% |
| M | 4821 | 3.9% |
| A | 4691 | 3.8% |
| n | 3729 | 3.0% |
| Other values (12) | 26592 |
Common
| Value | Count | Frequency (%) |
| 2 | 44554 | |
| 1 | 42943 | |
| 0 | 42422 | |
| - | 41221 | |
| 6 | 20784 | |
| 4 | 5144 | 2.5% |
| 5 | 4521 | 2.2% |
| 3 | 4166 | 2.0% |
| 9 | 264 | 0.1% |
| 8 | 59 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 329768 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 44554 | |
| 1 | 42943 | |
| 0 | 42422 | |
| - | 41221 | |
| e | 22111 | 6.7% |
| 6 | 20784 | 6.3% |
| p | 19803 | 6.0% |
| S | 17574 | 5.3% |
| u | 6780 | 2.1% |
| a | 6558 | 2.0% |
| Other values (23) | 65018 |
| Distinct | 44 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 714.7798937 |
| Minimum | 612 |
|---|---|
| Maximum | 827 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 322.2 KiB |
Quantile statistics
| Minimum | 612 |
|---|---|
| 5-th percentile | 667 |
| Q1 | 687 |
| median | 712 |
| Q3 | 742 |
| 95-th percentile | 782 |
| Maximum | 827 |
| Range | 215 |
| Interquartile range (IQR) | 55 |
Descriptive statistics
| Standard deviation | 35.97754636 |
|---|---|
| Coefficient of variation (CV) | 0.05033374144 |
| Kurtosis | -0.49297322 |
| Mean | 714.7798937 |
| Median Absolute Deviation (MAD) | 25 |
| Skewness | 0.4671441834 |
| Sum | 29463942 |
| Variance | 1294.383842 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 687 | 2246 | 5.4% |
| 702 | 2208 | 5.4% |
| 682 | 2162 | 5.2% |
| 697 | 2151 | 5.2% |
| 692 | 2136 | 5.2% |
| 677 | 1951 | 4.7% |
| 707 | 1902 | 4.6% |
| 722 | 1888 | 4.6% |
| 727 | 1842 | 4.5% |
| 717 | 1842 | 4.5% |
| Other values (34) | 20893 |
| Value | Count | Frequency (%) |
| 612 | 1 | < 0.1% |
| 617 | 1 | < 0.1% |
| 622 | 1 | < 0.1% |
| 627 | 1 | < 0.1% |
| 632 | 4 | < 0.1% |
| 637 | 3 | < 0.1% |
| 642 | 94 | |
| 647 | 105 | |
| 652 | 125 | |
| 657 | 124 |
| Value | Count | Frequency (%) |
| 827 | 2 | < 0.1% |
| 822 | 17 | < 0.1% |
| 817 | 24 | 0.1% |
| 812 | 114 | 0.3% |
| 807 | 177 | 0.4% |
| 802 | 232 | |
| 797 | 321 | |
| 792 | 403 | |
| 787 | 377 | |
| 782 | 539 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| df_index | loan_amnt | term | int_rate | installment | grade | emp_length | home_ownership | annual_inc | verification_status | loan_status | purpose | addr_state | dti | delinq_2yrs | earliest_cr_line | inq_last_6mths | open_acc | pub_rec | revol_bal | revol_util | total_acc | last_credit_pull_d | fico_average | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 5000.0 | 36 months | 10.65% | 162.87 | B | 10+ years | RENT | 24000.0 | Verified | Fully Paid | credit_card | AZ | 27.65 | 0.0 | Jan-1985 | 1.0 | 3.0 | 0.0 | 13648.0 | 83.7% | 9.0 | Sep-2016 | 737.0 |
| 1 | 1 | 2500.0 | 60 months | 15.27% | 59.83 | C | < 1 year | RENT | 30000.0 | Source Verified | Charged Off | car | GA | 1.00 | 0.0 | Apr-1999 | 5.0 | 3.0 | 0.0 | 1687.0 | 9.4% | 4.0 | Sep-2016 | 742.0 |
| 2 | 2 | 2400.0 | 36 months | 15.96% | 84.33 | C | 10+ years | RENT | 12252.0 | Not Verified | Fully Paid | small_business | IL | 8.72 | 0.0 | Nov-2001 | 2.0 | 2.0 | 0.0 | 2956.0 | 98.5% | 10.0 | Sep-2016 | 737.0 |
| 3 | 3 | 10000.0 | 36 months | 13.49% | 339.31 | C | 10+ years | RENT | 49200.0 | Source Verified | Fully Paid | other | CA | 20.00 | 0.0 | Feb-1996 | 1.0 | 10.0 | 0.0 | 5598.0 | 21% | 37.0 | Apr-2016 | 692.0 |
| 4 | 4 | 3000.0 | 60 months | 12.69% | 67.79 | B | 1 year | RENT | 80000.0 | Source Verified | Current | other | OR | 17.94 | 0.0 | Jan-1996 | 0.0 | 15.0 | 0.0 | 27783.0 | 53.9% | 38.0 | Sep-2016 | 697.0 |
| 5 | 5 | 5000.0 | 36 months | 7.90% | 156.46 | A | 3 years | RENT | 36000.0 | Source Verified | Fully Paid | wedding | AZ | 11.20 | 0.0 | Nov-2004 | 3.0 | 9.0 | 0.0 | 7963.0 | 28.3% | 12.0 | Jan-2016 | 732.0 |
| 6 | 6 | 7000.0 | 60 months | 15.96% | 170.08 | C | 8 years | RENT | 47004.0 | Not Verified | Fully Paid | debt_consolidation | NC | 23.51 | 0.0 | Jul-2005 | 1.0 | 7.0 | 0.0 | 17726.0 | 85.6% | 11.0 | Sep-2016 | 692.0 |
| 7 | 7 | 3000.0 | 36 months | 18.64% | 109.43 | E | 9 years | RENT | 48000.0 | Source Verified | Fully Paid | car | CA | 5.35 | 0.0 | Jan-2007 | 2.0 | 4.0 | 0.0 | 8221.0 | 87.5% | 4.0 | Dec-2014 | 662.0 |
| 8 | 8 | 5600.0 | 60 months | 21.28% | 152.39 | F | 4 years | OWN | 40000.0 | Source Verified | Charged Off | small_business | CA | 5.55 | 0.0 | Apr-2004 | 2.0 | 11.0 | 0.0 | 5210.0 | 32.6% | 13.0 | Sep-2016 | 677.0 |
| 9 | 9 | 5375.0 | 60 months | 12.69% | 121.45 | B | < 1 year | RENT | 15000.0 | Verified | Charged Off | other | TX | 18.08 | 0.0 | Sep-2004 | 0.0 | 2.0 | 0.0 | 9279.0 | 36.5% | 3.0 | Sep-2016 | 727.0 |
Last rows
| df_index | loan_amnt | term | int_rate | installment | grade | emp_length | home_ownership | annual_inc | verification_status | loan_status | purpose | addr_state | dti | delinq_2yrs | earliest_cr_line | inq_last_6mths | open_acc | pub_rec | revol_bal | revol_util | total_acc | last_credit_pull_d | fico_average | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 41211 | 42439 | 11625.0 | 36 months | 15.01% | 403.07 | F | 1 year | RENT | 32500.0 | Not Verified | Does not meet the credit policy. Status:Fully Paid | small_business | NY | 0.74 | 0.0 | Dec-2006 | 4.0 | 2.0 | 0.0 | 381.0 | 63.5% | 2.0 | Sep-2010 | 647.0 |
| 41212 | 42440 | 2500.0 | 36 months | 7.43% | 77.69 | A | < 1 year | RENT | 22800.0 | Not Verified | Does not meet the credit policy. Status:Charged Off | moving | GA | 0.53 | 0.0 | Sep-1997 | 5.0 | 1.0 | 0.0 | 416.0 | 10.4% | 2.0 | Jul-2010 | 747.0 |
| 41213 | 42441 | 11050.0 | 36 months | 15.96% | 388.28 | F | 3 years | RENT | 27716.0 | Not Verified | Does not meet the credit policy. Status:Charged Off | debt_consolidation | AZ | 12.90 | 0.0 | Jun-1997 | 3.0 | 9.0 | 0.0 | 2621.0 | 51.5% | 15.0 | Sep-2016 | 647.0 |
| 41214 | 42442 | 3000.0 | 36 months | 12.49% | 100.35 | D | < 1 year | OWN | 65000.0 | Not Verified | Does not meet the credit policy. Status:Fully Paid | credit_card | NY | 14.25 | 0.0 | Jul-1999 | 8.0 | 17.0 | 0.0 | 8143.0 | 60.3% | 24.0 | Dec-2009 | 667.0 |
| 41215 | 42443 | 6000.0 | 36 months | 14.70% | 207.11 | E | 1 year | MORTGAGE | 22000.0 | Not Verified | Does not meet the credit policy. Status:Fully Paid | debt_consolidation | NH | 20.00 | 0.0 | Jun-2000 | 19.0 | 17.0 | 0.0 | 15782.0 | 36.2% | 17.0 | Sep-2016 | 667.0 |
| 41216 | 42444 | 1500.0 | 36 months | 11.86% | 49.72 | D | 5 years | RENT | 28000.0 | Not Verified | Does not meet the credit policy. Status:Fully Paid | other | FL | 14.31 | 1.0 | Feb-2006 | 1.0 | 1.0 | 0.0 | 0.0 | 0% | 2.0 | Oct-2010 | 667.0 |
| 41217 | 42445 | 3000.0 | 36 months | 8.38% | 94.54 | A | < 1 year | RENT | 20000.0 | Not Verified | Does not meet the credit policy. Status:Fully Paid | educational | NY | 6.72 | 0.0 | Dec-1998 | 9.0 | 4.0 | 0.0 | 7021.0 | 27.4% | 4.0 | Jun-2016 | 732.0 |
| 41218 | 42446 | 4500.0 | 36 months | 8.07% | 141.15 | A | < 1 year | RENT | 18240.0 | Not Verified | Does not meet the credit policy. Status:Fully Paid | other | GA | 3.29 | 0.0 | Apr-2004 | 1.0 | 1.0 | 0.0 | 0.0 | 0% | 2.0 | Oct-2013 | 737.0 |
| 41219 | 42448 | 15000.0 | 36 months | 12.17% | 499.45 | D | 1 year | MORTGAGE | 83200.0 | Not Verified | Does not meet the credit policy. Status:Fully Paid | credit_card | WI | 17.02 | 0.0 | Oct-1995 | 5.0 | 14.0 | 0.0 | 37570.0 | 59.5% | 37.0 | Apr-2015 | 712.0 |
| 41220 | 42449 | 5000.0 | 36 months | 10.91% | 163.48 | C | < 1 year | RENT | 42500.0 | Not Verified | Does not meet the credit policy. Status:Fully Paid | credit_card | FL | 1.21 | 0.0 | Aug-2005 | 1.0 | 2.0 | 0.0 | 1424.0 | 43.2% | 3.0 | Oct-2010 | 677.0 |